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SUMMARY 

Physical training (PT) is an 'important component of Marine Corps basic train- 
ing (BT) . The primary PT objective is* to improve recruit fitness to prepare the 
men for the rigors of the Fleet Marine Force (FMF). However* because PT is part 
of an integrated program designed to achieve a variety of outcomes, it may be 
important to consider possible side effects of PT on nonfitness outcomes when 
designing or evaluating PT programs. 

Side effects of PT were investigated by relating naturally occurring platoon 
differenced in fitness outcome^ to (a) 'Esprit de Corps (measured by attitudes 
toward the Marine Corps and perceptions „of BT leadership and group cohesion) (b) 
improved self-confidence (measured by perceived self-improvement during BT) , (c) 
acquisition of military skills and knowledge (measured by BT tests and Drill 
Instructor ratings) , (d) BT health (measured by illness incidents requiring 1 or 
more dispensary visits), (e) * BT attrition, and (f) FMF success -(measured by 
attrition and promotion rates following BT) . The data for the analyses were 
originally collected as part of three BT stress studies carried out between 1978 
and 1981. ^ 

The scores from the standard physical fitness test administered at the end of 
the first two weeks of BT and again in the tenth week were employed to determine 
fitness improvement for individual recruits. Platoons were classified as low or 
high improvement based on the average improvement of the platopn members. ~ Both 
simple difference scores and residualized gain scores, were used to estimate im- 
provement to minimize the risk of erroneous classifications resulting from statis- 
tical limitations of either score used alone. The relationships between PT 
improvement and the outcomes described above were determined by t- tests between 
the two groups and contingency table analysis* 

The high improvement group showed (a) more positive attitudes toward the 
Marine Corps and Marine Corps philosophy (i.e., greater acceptance of authority, 
higher commitment to doing well in the Marine Corps, and greater general satis- 
faction with the Marine Corps); (b) greater feelings of self-improvement during 
BT; and C (c) better performance during BT, Another positive correlate of high 
improvement was that recruits in this group described their Drill Instructors as 
better examples of what a marine should be. A possible negative effect of PT was 
higher attrition in the high improvement group; the most consistent element of the 
attrition pattern was higher medical attrition in the high improvement group. 
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The study objective was to determine whether PT might have aide effects which 
should be taken into account when designing PT programs. The significant associ- 

* • 

ations described above show that PT side ^ffects may be important. Additional 
research is needed to determine whether PT actually causes the nonfitness outcomes 
or is coi...*ated with those outcomes for some other reason, whether the side 
effects are large enough to be of practical concern, and what specific aspects of 
PT programs affect which cjtcomes. This information would provide a basis for 
guidelines to design PT programs to enhance the overall effects of PTl ^ > 
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INTRODUCTION 

Improving the physical f itnes^ of recruits to prepare t:hem for the physical 
rigors of Fleet Marine Force (hereafter , FHF) activities is one objective of 
Marine Corps basic training, (hereafter, BT) (1, p. 1-1) . The success of the cur- 
rent physical training , program/ (hereafter, P?) in evident; average scores on a 
standard physical t .tness test increase by 25% during BT (2). Even though this' 
improvement indicates success in achieving the primary PT goal, therein* M times 
when program modif icationsy'are' considered. For example, the prog? gui be 

restructured if changes w<j(ild make more time available for higher priority train- 
ing activities or would jftake BT graduates better prepared for the specific phys- 
ical requirements of t/e PMP. The present report examines six possible sifle 
effects of PT* which /may deserve attention when considering possible program 
modifications* / & 

The side effect concept for PT is borrowed from medical terminology • In 

medical terms, a side effect is a physical or psychological reaction to treatment 

which is not the /pr Lmary or intended treatment goal. Because recruit PT is part 

of an integrated 0 program designed to achieve multiple BT objectives, a pt program 

may produce side effects by Influencing nonfitness BT outcomes, including 

recruits 1 acquisition of self-discipline, military skills, marksmanship, military 

bearing, and Esprit de Corps (1, p. 1-1). J 

a 

Incidental data analyses in our earlier work on the effects of psychological 
stress in BT (2) showed substantial platoon differences in titndss at , the end of 
BT. These differences occurred despite the fact that the PT program was highly 
structured. However, training procedures permit some augmentation of the standard 
program at the discretion of the trainin" personnel (up to carefully specified 
limits) . The observed platoon differences implied that differential use of this 
discretionary latitude resulted in significantly different PT programs in 
different platoons. 

The naturally occurring platoon fitness differenced provided the opportunity 
to explore possible side effects of PT programs. This report therefore examines 
ho,w platoons which differed in fitness improvement during BT fared with respect to* 
other BT outcomes. Platoons with relatively large fitness improvements are com- 
pared to platoons with relatively small improvements to test six hypotheses: 



Hypothesis It- High fitness improvement will be associated with gre ater Esprist de 
Corps . 

Difficult initiations produce positive attitudes toward the organization 
joined (3-5) , so high improvement should be associated with more positive atti- 
tudes toward the. Marine Corps at the end of BT. This basic hypothesis was 
extended to include * increased group cohesion and more positive perceptions of 
leadership because positive evaluation may generalize -o perceptions of other 
members of the organization. „< • « 



/ 



hypothesis 2: High 1 fitness improvement will be associated with create* p erceived 
/ ^ self-improvement in recruits . , 

Interviews with recruits suggested that feelings of self-improvement arise 
from mastering significant BT challenges. . PT is one of the challenges that 
recruits must master, so more difficult PT should produce greater gains in self- 
esteem. If improved physical capabilities contribute to improved self-esteem,, 
high Improvement PT programs can promote the BT objective of developing recruits 
self-reliance and confidence (1, p. 2-2) . . 
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Hypothesis 3: Fitness improvement will be related to BT performance . 

This hypothesis concerns the BT objectives pf developing military skills, 
marksmanship, and military bearing. These performance variables are focal points 
for the training program (1, p. 1-1), but two conflicting possibilities appeared 
reasonable. Programs that produce larger than average improvement could produce 
better performance as part of a general striving for excellence. Howeve 1 . , if 
fitness training is emphasized at the expense of other aspects of training this 
emphasis would impair performance in other areas. ■ 



The three remaining- hypotheses concern possible side effects which were not 

> o 

explicit BT objectives, but which are logically related to those objectives. In 
contrast to the first three hypotheses which all reflect potential, positive side 
effects, two of the remaining hypotheses concern possible negative side effects. 



Hypothesis 4: High fitness improvement will be associated with poorer health 
during BT . v 

More demanding physical programs probably .will produce more sprains, strains, 
and other minor injuries because the increased Remands push more recruits to their 
physical limits and beyond . A more speculative possibility is that heavy physical 
demands will reduce resistance to infectious diseases which are common in BT 
(6,7). These possible effects are important because illness is inherently un- 
desirable and likely to impede progress toward stated BT objectives. 



Hypothesis 5: High improvement will be associated with higher attrition . 

This hypothesis assumes that high improvement programs will cause more in- 
iuries, exacerbate more old medical problems, and/or cause more recruits to give 
up because they feel they cannot meet the strict, program requirements. The first 
two possibilities would lead to higher rates Sf medical attrition and the third 



po 



ssibility to increased attrition for behavioral cau&es. 
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Hypothesis 6j Graduates from the high improvement group Will" hav e greater FMP 

c success" than those from the low improvement gro up. 

All other things being equal, PMP tuccess implies- that a recruit satisfied 
the BT objective of learning to maintain or improve tho«e traits that distinguish 
a Marine (i, p. 1-1). However, the usefulness of FMP success as a general crite- 
rion of Marine Corps success is limited by the qualification that PMP conditions 
are probably important contributors to this outcome. Thus, this hypothesized 
associ-ition would be expected to be weak. 4 



METHOD 

Samples^ , 

Data from three studies originally designed to assess the effects of psych- 
ological stresses on BT outcomes were employed to test the hypotheses outlined 
above. Descriptive statistics for'the recruit samples in each study are provided 
in Table 1. «. 



TABLE I 

DESCRIPTION OF RECRUlf COHORTS AND STUDY DESIGNS 

i 

RACE 0 



STUDY 


SAMPLE 


AGE 


EDUCATION 


GCT 


w 


B 


H 


O 


1 


413 


18.7 


11.4 


104.1 


73.9% 


11.1% 


■ o 

11.4% 


3.7% 


2(a)* 


2360 


19.4 


11.5 . 


103.2 


••• 


••* 


•*•' 




2(b) 


425 


19.1 


11.6 


104.8 


£6.9% 


15.3% 


14.1% 


4.6% 


3 


2648 


18.9 


11.7 


104.1 


70.7% 


16.3% 


7.7% 


5.5% 



fl Race groups were W = White, B » Slack, H » Hispanic, and 0 = Other. Penantages were based on responses to a questionnaire item 
^ concerning race. Nor.respondents comprised 12.8% of the Study 1 sample, 13.7% o{ Study 2, and 2.4% of Study 3. The first two 
studies explicitly noted that providing this information was optional; the third study made no comment regarding this specific item. 
^Iri Stud. >nly the subset of 425 recruits completed the questionnaires. Other data were obtained for the full samplo. 
* 'Respondents in Study ?.{a) did not complete the background questionnaire that provided race information. 
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■ • . N 

Physical Training Improvement Classifications 

% " * *\ 

Physical fitness scorer** were the. results of a standard BT physical . fitness 

test consisting of 'pull-ups, sit-ups, and a 3-mile run. Scores* on each test com- 

ponent could range from 0 to 100 for a total possible score of 300. . „ These tests 

were administered during the third week of training and again seven weeks later in 

the last week of training (hereafter, PFT1 and PFT2) . 

»Two physical fitness improvement scores were cojnp r uted. for each recruit. 

Fiest, the difference between the scores on the first and second test was com- 



puted for each recruit. Second, regression analysis "produced an equation to pre- 
diet PFT2 from PFl^l. Bach recruit's predicted value was then subtracted* from his 
actual final score to measure rssidualized gain (hereafter, gain) . 

Platoons were classified as high improvement if they were in the top 40% of. 
the platoons in their study based on (a) average difference, (b) average gain, (c) 
median difference, apd (d) median gain. Low improvement platoons were those in 

r 

..the. lowest 40% of the sample for all four criteria. The analyses were restricted 
to extreme groups to ensure clear-cut group differences in fitness improvement. 

Multiple criteria ensured tha': classification was based on unbiased estimates 
of improvement that were independent of initial fitness level and that were not 
influenced by one or two extreme scores in a platoon. Difference scores are un- 
biased estimates of improvement, but can be influenced by initial fitness level 
(8) • Gain scores are.* independent of initial fitness level, but are subject to 
bias when based on inappropriate assumptions about growth catves" (8,9) • The^ 
available data did not provide sufficient information to define growth curves 
precisely, >so : difference scores were used to ensure that the potential bias did 

not affeqt— the findings. Finally, average scores are sensitive to outliers, but 

« * 

median scores are not (1,0, pp. 61-62). tfhe combined criteria -therefore protected 

p 

against important sources of classification error when defining 'the fitness 

improvement groups. Details of a series of analyses supporting, the classi- 

# 

fication process are given in Appendix A. 
Esprit de Corps Measures 

Esprit de Corps was assessed by measuring attitudes toward t\\e Marine Corps, 
perceptions of leadership in BT, and perceptions of group cohesion in BT. Each of 
the measures represents an evaluation of the Marine Corps or perception of fellow 
Marines consistent with the generals-definition of Esprit given in BT manuals (1, 
p. 1-1) . Each attitude or perception was measured by a multi-item L^kert scale 

0 

taken from questionnaires employed in Studies 1 and 2 (2,11,12). 
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The four attitudes measured were: 



Affiliation: 



The extent to whicft. the regfruit identifies himself more 
with the Marine Corps than with the civilian population, 



Example Items; (a) I feel that. my outlook Is really more that of a 
' civilian than a Marine*. (Reverse -scored) *. 

(b) i f ' toy Commanding Officer offered me an honorable 
discharge right how, I would k not take it. 

* ' f " ^ 

Source: Booth and Hoiberg (13) . . - ^ 



Authority : The extent to, .which the recruit possesses' traditional 

views of authority including t ccoptance « of the necessity 
/ for and importance, of accepting Authority in the' Marine ' 

"Corps. v ' , 

Example Items: (a) The discipline you getJ in the Marine Corps is good 
. 9 . £or you. ■ • 

(p) A Marine should not be allowed to talk .back to his 
superiors. * 



S9urce: Booth *au>cL iToibe^g (13) . 



Commitment: 



The extent\to which the recruit feels thajt it is impor- 
tant to h impersonally to achieve and maintain a high 
level of performance in the Marine Corps. 



Example Items: i (a) It is important to me personally to have a good 

record in the Marine Corps. * j 

* * *(b) I don't 1 care how well I do in the Marine Corps. 
(Reverse scored) # 

Source: Drucker (14) • - 



Satisfaction: 



The extent to which the recruit holds a positive overall 
evaluation of the Marine Corps and perceives it as better 
than alternative occupations. 



Example Items: (a) All in al^L, I am very satisfied with being a Marine. 

(b) If .1 had my choice between joining the Navy, Army, 
Air Force, or Marines, I would still prefer to join 
the Marines. 

Source: Modified from Quinh and Shepard (15) • 



Th* leadership measures were four scales that were* present in both Study 1 
and Study 2.' Additional leadership scales 'available in , one study but not the 
other we« exclyded-trom the analyses to ensure that conclusions were based on 
reliable differences. The four scales present in both studies included the best 
•marker variables for the two leadership factors in our questionnaire (12), so the 
major BT leadership elements perceived by recruits were reflected despite the 
restriction to four scales. 

Le ader Structure :The extend to which Drill Instructors provided means-end 
structuring by detailing who was to do what and when. 

Example Items: (a) Our Drill Instructors told us exactly how to do 

things. 

.* (b) Drill Instructors told us why things had to be done. 

Leader Support: The extent to which Drill Instructors communicated a 

concern for the well-being of the recruits and a respect 

for the platoon. 

Example Items; (a) The Drill Instructors were interested to our welfare. 

(b) The Drill Instructors were proud of the platoon and 
the recruits in it. 

S 

Referen t Power : The extent to which Drill Instructors were regarded as 
! — setting a good example which t^e .ecruits want to copy. 

Example Items: (a) I would like to be like my U Instructors. 

(b) I respect my Drill Instructors as people. 

Expert Power : The extent to which Drill Instructors were expert and 

knowledgeable in their job.* 

Example Items: (a) My Drill Instructors are well-qualified for their \ 

jobs. \ 

(b) My Drill Instructors are very good at what they do. 
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Group cohesion was represented by two scales* 



/ 



Group Teamwork ; The extent to which recruits cooperated with one another 
and worked as a team to get necessary tasks done. • 

Example Items: (a) In our platoon people cooperated to get things done. 
* • * 

(b) Recruits stressed teamwork and team ^oals. 



Group Support : The extent to which Recruits in the platoon tried to make 
one another feel better when things were going bad and/or 
provided actual assistance on task3 that did not 
necessarily require teamwork. 

Example Items: (a) Recruits in the platoon trust one another. 

(b) Recruits in the platoon lent each other a hand when 
thing* got rough. 



Self-Improvement Scales 

Two multi-item Likert scales developed for Study 2 (2) measured self- 
imp;ovement: , 



Self-Esteem : The extent to which the recruit reports that basic 
training has improved him physically and mentally. 

Example Items: (a) I have more self-discipline than when I started 

training. 

(b) After going through boot camp, I believe I can do 
anything I set my mind to. 

Social Skills : The extent to which the recruit feels that basic training 
has increased his ability to get along with and work wit^h 
other types of people. 

Example Items: (a) As a result of training I've learned to get along 

with other people much better.' 

(b) In boot camp I've learned the importance of working 
together to get things done. 



Performance Measures 

Performance measures were scores from standard* tests routinely administered 
during BT. These included one academic test administered approximately *-2^1/2 
weeks into BT. and another academic test administered about 7 weeks later at the 
end of BT. The first test, Practical Examination 1, produced a single scor* with 
a maximum value of 100. For the secdRd test, Practical Examination 2, separate 
scores were recorded for the oral portion and the written portion. The maximum 



score for each poftioji of the second examination was 50. In addition, a total 
score for the second examination was computed by summing the two tests for com- 
parison to the first examination. 

Other performance measures were scores for rifle marksmanship qualification 

(i 16 Score, maximum possible score - 250), and Drill Instructor ratings 'of general 
acceptance of Marine Corps standards of behavior and philosophy (Conduct) and of 
general performance on both tests and routine day-to-day performance during BT 

(Senior. Drill Instructor Subjective Evaluations* hereafter, Subjective 
Evaluations) . The ratings, were made at thjp end of BT; the maximum possible score 
for each rating was 5.0. Detailed descriptions of each performance measure can be 
found in the manual of standard operating procedures for training male Marine 
Corps recruits current when these studies were executed (1) . 

Health Indicators 

Health wa*i measured by number of illness incidents during BT. Illness inci- 
dents were identified from' health care records kept at' the Recruit Training Depot 
Dispensary. An illness incident was defined -\s a visit or series of visits for a 
particular set of presenting complaints with no more than 7 days separating two 
consecutive visits. If more than 7 days separated two consecutive visits, the 
^-visits were scored as separate illness incidents unless there was a specific note 
that the problem was a continuation of a previous incident. 

The diagnosis was recorded for each illness incident. After reviewing the 
frequency of different types of incidents, this study employed 5 health measures: 
(a) Total number of incidents, (b) Number of upper respiratory/viral incidents, 
(c) Number of trauma/injury incidents, (d) Number of "other" health problems, 
and (e) Number of sprain/strain incidents. Sprains/strains were part of tho 
trauma/injury category, but were singled out for special, attention because this 
type of problem seemed particularly likely to be affected by PT demands; 

FMF Success 

Success in the Fleet Mar ine Force (FMF) was scored as follows : 1 = 
Discharged for behavioral reasons? 2 = Below average rank (E-l or E-2) ; 3 » 
Average rank (E-3) ; 4 = Above average rank (E-4 or E-5) . Men discharged upon 
completion of active duty requirements (e.g.., reservists), for medical reasons, or 
for miscellaneous nonbehavioral problems .were excluded from the FMF success 

/ 
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analyses because their "follow-up status was an ambiguous indicator of FMF per- 
formance* Data were obtained from computerized records at Headquarters, Marine 
Corps, 3 to 3-1/2 years after entering BT# 

Analysis Procedures 

The t-test was used to compare the low and high improvement groups on atti- 
tude, leadership, group cohesion, sel^ improvement, and performance measures. 
Chi-square tests of association assessed relationships between fitness improvement 
and health, attrition from BT, and FMF success. Where appropriate, the binomial 
test was used to test the hypothesis that specific events (e.g. , behavioral 
attrition) were randomly distributed between the low and high improvement groups 
for specific outcomes (e.g., the number of behavioral attrites in each group) (16, 
pp. 580-586). 

A result was . significant if: (a) Group differences were in the same direc- 
tion in each study. (b) The difference was significant at the p < .05 level in at 
least one study and at the p < .25 level in each other stqdy. (c) The pooled 
significance across studies was p < (.05 / number of variables used to test the 
hypothesis) • These criteria ensured consistent trends across studies, allowed for 
the fact that even significant associations will produce some nonsignificant find- 
ings by chance (17) , anc 1 kept experiment-wide error probability to 5% or less for 
each hypothesis (18). The methods of adding logarithms and adding probabilities 
were used to estimate pooled significance (19) . 



RESULTS 

Hypothesis It High fitness improvement will be asso ciated with greater Esprit de 
Corps . 

Hypothesis 1 was supported for attitudes (see Table 2) . Each of the 8 
attitude comparisons showed more positive attitudes in the high improvement group. 
The differences met the pooled significance criterion for acceptance of authority 
and satisfaction with the Marine Corps (pooled significance p < .001 for both). 

The speculative extension of Hypothesis 1 to include leadership and group 
cohesion (p. 2) was supported only by the Referent Power results (see Table 2). 
The high improvement group consistently saw their Drill Instructors as better 
examples of what a Marine should be (pooled significance, p < .001). 

• N • . . . • 



TABLE 2 

COMPARISON OF ESPRIT DE CORPS INDICATORS 
FOR THE LOW AND HIGH FITNESS-IMPROVEMENT GROUPS 







STUDY 1 






STUDY 2 






LOW 


HIGH 


t 


SIG.° 6 


LOW 


HIGH 


t 


SIG.° 


ATTITUDES 
Affiliation 
Authority 
Commitment 
Satisfaction 


c 

3.61 
4.67 
5.65 
4 4.99 


3.96 
5.07 
5.72 
5.33 


1.94 
2.95 
0.89 
2.62 


.027 
.002 
.187 
.005 


4.1? . 
4.83 
5.67 
4.69 


4.22 
5.04 
5.78 
4.93 


0.19 
2.02 
1.56 
1.79 


.423 
.022 
.061 
.037 


" . LEADERSHIP 

Leader Structure 
Leader Support 
Expert Power 
Referent Power 


4.27 
4.11 ~ 
5.44 
3.79 


4.74 
4.65 
5.71 
4.44 


3.57 
4.03 
3.00 
4.11 


.001 
.001 
.002 
.000 


4.57 
3.91 
5.52 
4.55 


4.25 
3.87 
5.56 
4.91 


-2.37 
-0.21 
0.53 
2.77 


1.010)? 
(.419) 6 

.298 

.003 


GROUP COHESION 
Group Teamwork 
Group Support 


3.65 
2.68 


4.02 
3.41 


2.35 
5.49 


.010 
.001 


3.73 
2.98 


3.49 
2.93 


-1.46 
-0.39 


(.071)* 
(.350) 6 



<;The indicated significance is one^tailed because there was a directional hypothesis for these analyses. 

''Group moans m this analysis were contrary to the prediction that the high fitness improvement group would have a higher mean. 




Hypothesis 2; High fitness improvement will be associated with greater perceived 
- giclf- improvement in recruits . 
Hypothesis 2 was tested only in the second study. The hypothesis *as sup- 
ported for Self-Esteem, but not Social Skills (see Table 3) , Informal replication 
of the Self-Esteem finding was provided by "our earlier interviews with recruits, 
but it must be kept in mind that this finding has not been formally replicated. 
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TABLE 3 

COMPARISON OF SELF-IMPROVEMENT INDICATORS 
FOR THE LOW AND HIGH FITNESS IMPROVEMENT GROUPS 







IMPROVEMENT GROUP:.. 
LOW HIGH. 


1 !■ I^B^=^^= 
t 


SIG.° 


^If-Esteem 




5.41 


5.66 


2.76 


.003 


Social Skills 


•> 


5.07 


5.06 


.-o.il 


(.455) fc 



NOTE} The selNmprovement indicators were only included in Study 2. \ 
tf The indicated significance is one-tailed because there was a directional hypothesis for these analyses. 

^Group means in this analysis were contrary to the prediction that the high fitness improvement group would have a higher mean. 



Hypothesis 3: Fitness improvement level will be related to BT ^performance > 

The performance hypothesis did not specify the direction of the association 
because both positive and negative effects were plausible. A general trend toward 
better performance in the high improvement group was indicated by a higher score 
in 14 of 15 comparisons (x 2 = 11.27, 1 df, p < .001). Despite this consistency, 
only the oral and total test scores for the second academic examinations were 
significantly different using the criteria established for the study (see Table 4, 
p. 12) . The pooled significance was p < .001 for both. 

Hypothesis 4: High fitness improvement will be associated with poorer 
health in BT , 

, Data from the second and third studies failed to support Hypothesis 4 even 
when analysis was restricted to sprain/strain diagnoses. However, *a trend toward 
higher medical attrition in the high improvement group (see below) provided some 
support for this hypothesis. 
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TABLE 4 



Practical Exam 1 

M16 Score 

Practical Exam 2: 
' Oral 
Written 

Total 



COMPARISON OF HIGH AND LOW IMPROVEMENT GROUPS 
ON TRAINING PERFORMANCE MEASURES 



STUDY 1 
LOW HIGH t SIG. 



b8.0 89.0 0.93 .351 
202.4 202.2 -0.11 .914 



47.4 48.8 
45.4 45.5 

92.7 94.3 



3.94 
0.20 

2.17 



.001 
.843 

.031 



STUDY 2 
LOW HIGH t 



SIG. 



87.1 90.8 8.92 .001 
201.8 203.7 2.53 .011 

48.3 48.7 3.46 .001 

47.2 47.8 3.34 .001 

95.5 96.5 4.89 .001 



LQV-; 



STUDY 3 
HIGH t 



SIG. 



87.9 88.1 0.54 .592 

203.8 204.3 0.58 .563 

45.1 46.3 5.56 001 

44.8 46.5 9.23 .001 

89.9 92.8 9.75 .001 



NOTE: Significance levels are 2-tailed because there was no directional prediction for the effects. 



Hypothesis 5: Attrition during basic training will be higher in the high 
improvement group . 

Hypothesis 5 was supported using the binomial probability model to test the 
hypothesis that the distribution of attrites between the low and high improvement 
groups was equal to that expected by chance. The average overall attrition rate 
in the low improvement groups was 10.0%; the average for the high improvement 
groups was 14.5% (see Table 5, pg. 13). Thus, attrition in the high fitness 
improvement group was 45% higher than in the low improvement platoons. The trend 
toward excess of attrition in the high improvement group was highly significant in 
Study 3 (p : .001, one-tailed) , but only marginally significant in Study 2 (p <* 
.072, one-tailed) . The pooled significance was p < .003, so the overall results 
clearly supported the hypothesis. . 

Closer examination of the attrition trends showed that the tendency for ..the 
high improvement group to have excess medical attrition was more stable than the 
tendency to have exfless behavioral attrition. Applying the binomial test to the 
medical attrition data showed a significant difference in Study 2 (p < .044) with 
a marginally significant difference in Study 3 (p < .112) (pooled significance, p 
< .013) . Applying the same test to the behavioral attrition data produced p < 
.283 for Study 2 and p < .001 for Study 3 (pooled significance, p < .041). 
Employing the significance criteria established for this report, the behavioral 
attrition trend was nonsignificant because .the Study 2 probability was p > .25. 

-12- 
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TABLE S 

PLATOON FITNESS IMPROVEMENT LEVEL AND BASIC TRAINING ATTRITION 



ATTRITION STATUS 



STUDY , IMPROVEMENT LEVEL BEHAVIORAL MEDICAL . GRADUATE 



2 


Low 


5.5% • 


4.7% 


89.7% 






High 


12.4% 


4.6% 


83.1% 




3 


Low 


6.0% 


3.7% 


90.3% 






' High 


6.7% 


5.4% 


88.0% 





NO T£; tercentoges do not sum to 100 due to rounding error. 



Hypothesis 6: Graduates from the high improvement group will have greater FMF 
success than those from the low improvement group ., 

yhese analyses were limited to Studies 2 and 3 because of the small sample in 

* 

Stugy 1* There were no significant differences in likelihood of discharge or rate 
of advancement in the FMF. 



DISCUSSION AMD CONCLUSIONS 

Evidently, PT can have side effects during BT as indicated by findings sup- 
porting 5 of the 6 hypotheses tested. However, three major issues must be 
considered before this observation can be translated into guidelines for the 
design of PT programs, yhese issues are the mixed positive and negative character 
of the side effects, the direction of causality for the side effects, and the size 
of the side effects. • 

The mixed character of the side effects is evident in the fact that the high 
improvement groups combined better attitudes toward the Marine Corps and leader- 




ship, greater perceived self-improvement, and better academic performance with 
higher attrition* If the added attrition in the high improvement group represents 
recruits who could have' performed satisfactorily in the FMF, the increased 
attrition is a negative side effect. However,' if the added, attrition reflects 
early identification of recruits who cannot adapt to the the Marine Corps' 
challenge, the increased attrition would be a positive side effect (1, p. 2-1). 

Although the ambiguity of the attrition findings cannot be resolved at this 
time, several points connected with the attrition results are important for future 
consideration. First, if the additional attrition represents poor performers or 
individuals with bad 'attitudes, the added N <*ttr ition would improve performance and 
attitudes. If so, the attrition differences could explain the other differences 
between the low and high fitness improvement groups. Under these circumstances, 
the added attrition would be the key side .effect for PT because it would explain 
the other side effects. Second, even if attrition *is positive from the organiza- 
tional perspective, it will be a negative event for most of the individuals dis- 
charged. Finally, if high attrition is a negative side effect of high improvement 
PT programs, low attrition must be regarded as a positive side effect of low 
improvement PT programs. 

The second issue for future consideration is the direction of causality for 
the associations described in this report. These associations have., been inter- 
preted htere as indicating that PT caused nonfitness BT outcomes. This focus has 
been chonen to emphasise the possibility of changing nonfitness BT outcomes by 
altering PT programs. Oth*- . plausible interpretations are possible. For example, 
positive attitudes at the beginning of BT might contribute to greater effort 
during training which then produces "larger improvements in physical fitness. If 
PT is not a cause of the nonfitness BT outcomes, changing PT programs will not 
enhance the effects of BT. Therefore, alternative explanations should be devel- 
oped and tested now that it has been established that significant associations 
exist which require explanation. 

Longitudinal field studies, particularly field experiments, vfould be required 
to te3 alternative causal hypotheses. Such studies would make it possible to 
determine whether pre-existing recruit characteristics (e.g., positive attitudes) 
determine fitnesV improvement, whether fitness improvement determines how atti- 
tudes change, or whether both assertions are true. The results of such studies 
would be of value no^^attor what the outcome, because even the alternative 
explanations, would help understand the dynamics producing BT outcomes. 

\ -14- 



The third issue for further study is whether the side effect? of PT are large 
enough to have practical importance* The differences between the low improvement 
and high improvement groups were modest in the studies reported here, but method- 
ological factors way have produced this trend. One such factor is the use of 
insensitive measures for two key dependent variables. Many recruits scored at or 
near the maximum for several of the attitude and academic performance measures. 
This indicates that the measures were insensitive to true differences in attitudes 
and • knowledge ° that occurred in ranges above the upper limit of the scale (2a) . 
Because the data trends implied that the high improvement group was higher on 
these measures, this restriction would affect this group more than the low im- 
provement group. If so, the observed differences underestimated true differences 
because the high improvement group could not demonstrate its superiority. The 
development of outcome measures sensitive to the higher ranges of attitudes aod 
knowledge would help resolve this issue.' 

A second methodological factor that may have affected the size of the associ- 
ations was the use of an overall fitness measure to classify groups. The problems 
associated with the use oi an overall improvement measure have important implica- 
tions in connection with the possibility of designing PT programs to promote the 
full range of BT objiccives. Therefore, this point will be developed in some 
detail • 

The use of an overall fitness improvement measure to classify platoons means 
that platoons with similar overall improvement can differ in important ways. For 

example, one high improvement platoon maySjiay* nmphasized strength training while 

* 

another -high improvement platoon emphasized cardiovascular endurance training. 
Other differences could arise if different methods were used to achieve identical 
fitness gains. For example, one platoon could hav. employed 1 infrequent, but high 
intensity, exercise wh.lie another platoon exercised more f recently at a lower 
intensity. 

Platoon differences such as those just described are important if PT side 
effects depend on the particular type of physical improvement achieved and/or the 
jspecific methods employed to produce physical improvement. Extending the second 
example given above, positive attitude changes may depend on mastering high inten- 
sity xercise demands. If so, attitude effects such as those noted in this s.udy 
would be limited to platoons which emphasized exercise intensity rather than exer- 
cising frequently. Analyses comparing groups defined on the basis of overall 




Improvement will water down the side effects of intense PT by' combining high 
intensity platoons with the low intensity platoons which achieved sitailar fitness 
gains but did not produce th? attitudinal side effects. 

The above example could be extended to many combinations of type, intensity, 
and duration of exercise, type and amount of fitness improvement, and type of 
nonfitness BT outcome. The range of possible combinations leads to the conclusion 
that more detailed assessments of actual patterns of exercise and specific aspects 
of fitness improvement are needed to accurately estimate the potential for using 
PT to promote nonfitness BT outcomes. The resulting knowledge would provide a 
basisXfor . accentuating positive side effects while eliminating negative side 
effects if the two types of effect depend on different PT elements. 

To summarize briefly, the results reported here supported the possibility 
that PT can affect nonfitness BT outcomes. Having verified this point, more de- 
tailed studies of the relationships between specific PT elements and particular 
nonfitness BT outcomes are needed to determine whether elements of PT programs 
really cause differences in nonfitness BT outcomes, to determine whether the side 
effects are large enough to have practical importance, and, if so, to isolate the 
critical elements of PT for producing positive side effects while avoiding nega- 
tive side effects. 0 The additional research could provide guidelines for designing 
PT programs to maximize overall BT outcomes. 
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APPENDIX A 



DEFINING FITNESS IMPROVEMENT GROUPS 
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Testing the ^research hypotheses required that, platoons be qharabterized in 

_ ■ 

terms of fitness improvement during BT. It was .therefore necessary to estimate 
change^ in fitness from early in BT to the end of BT • Becauee change measurement 
4 Is ^ complex statistical problem, a series of analyses were 1 coroleted to ensure 
that fitness improvement was measured properly and that the resulting platoon 
classifications provided legitimate tests of the. hypotheses described in the 
introduction. * - 

Sample Comparisons . Initial analyses compared the PPT > score distributions 
for the three samples to deter mihe whether the samples could be regarded as de- 
riving from a single population. If not, the three studies could not reasonably 
be treated as replications of^one anott£k. Results showed: C 



(a) The average PPT scores varied c substantially for both tests, but 
variability around the means was stable across studies (see Table A- 
1). 

(b) The average difference between PFT2 and PFT1 ranged from 41.28 in 
Study' 3 to 48.02 in Study 2. The variation was statistically sig- 
nificant (P » 10.00, p < .001), but accounted for only 0.5% of the 
overall variation in PFT2-PFT1 differences. Also, the*7-point range 
for differences was substantially smaller than the 21 point range 
for PPT1 a J the 17-point range for PFT2. Thus, amount of change 
was roughly comparable over studies. 

- 

(c) The regression lines predicting PPT2 from PPT1 were highly similar 
across studies (see Table A-l) . Each regression coefficient fell 
within the 50% confidence intervals for the other two studies* 



TABLE A-l 
PHYSICAL FITNESS TEST DATA 



PFT1 



PFT2 



REGRESSION COEFFICIENTS 



COHORT 


MEAN 


S. D. 


MEAN 


S. D. 


T 12 


b 


a 


Study 1 


173.50 


41.35 


219.99 


.38.17 


.618 


.570 


121.46 ' 


Study 2 


187.16 


43.75 


235.14 


38.18 


.687 


•699 


123.00 


Study 3 


194.79 


.43.91 


236.07 


35.45 


.720 


.581 


122.80 



NO TE: The regression coefficients produce predictions ofPFT2 based on PFTi as foiiows: Predicted PFT2 *a + b(PFT1 / 



0 



I 



0 

ERIC 



The results for this initial analysis of the fitness scores indicated sufficient 
similarity to treat the samples as replications. This conclusion was appropriate 
given the emphasis on fitness improvement. The sample variations in PFT2-PFTl_ 
differences and the regression lines required for computing residualized gains did 

He 

'not appear sufficiently large to regard the samples as noncomparable. 

Platoon Differences in Fitness Improvement . The next set of analyses con- 

firmed that fitness improvement varied significantly 'across BT platoons. ' This 

confirmation was a logical requirement for asserting that platoons truly had 

different PT programs. Comparing programs which differed by only chance amounts 

woul« not provide a meaningful test of our hypotheses. Analyses showed: 

(a) Fitness improvements varied significantly across P^toons. This was 
> .true of both the simple difference between PFT2 and PFTl and the 
residualized gain score described in the Method section (see Table 
No te that the platoon differences in the two improvement 
measures were much more substantial than the platoon differences in 
either iniSIl fitness level or final fitness level , as indica e y 
the larger eta coefficient. Note also that the significant dif- 
ferences for the gain scores reduces the likelihood that the "suits 
derived from statistical artifacts arising from the presence of PFTl 
differences (e.g., regression to the mean). 



TABLE A-2 . 

RESULTS OF ONE-WAY ANALYSES OF VARIANCE 
COMPARING INDIVIDUAL PLATOONS 
ON INITIAL FITNESS, FINAL FITNESS, AND FITNESS IMPROVEMENT MEASURES 





eta 


PFTl . 
F 


SIG. 


eta 


PFT2 
F 


SIG. 


eta 


GAIN 
F 


SIG. 


DIFFERENCE • 
eta F SIG. 


Study 1 


.33 


1.39 


.106 


.43 


2.69 


l .001 


/^1 


4.18 


.001 


.48 3.49 .001 


Study 2 


.28 


3.52 


.000 


.36 


6.49 


.000 


.47 


12.08 


.000 


.45 10.61 .001 


Study 3 


.33 


2.75 


.000 


.43 


2.05 


.000 


.51 


6.12 


.000 


.36 " 7.29 .001 



NOTE. Set Method wtion for description of the number of platoons and recruits in each study. 
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(b) The fitness improvement measures were highly correlated across 
platoons (see Table A-3) . Thus, classifications could be developed 
which minimised important problems in the assessment of change by 
combining the four potent:*! classification criteria (see pp. 3-4) . 



TABLE A-3 

RANK ORDER CORRELATIONS BETWEEN DIFFERENT POSSIBLE CRITERIA 
FOR ESTABLISHING PHYSICAL TRAINING INTENSITY LEVELS 



Study 1 ( n - 26) 
•Mean for Differences 
Median for Difference! 
Mean for Gain 
Median for Gain 

Study 2 (n- 40) 
Mean for Differences •• 
Median for Differences * 
Mean for Gain 
Median for Gam 

+. 

\ 

Study 3 (n- 37) 
Mean for Difference 
Median for Differences 
Mean for Gain 
Median for Gain 



Tests of Regression Assumptions for Gain Scores , Having established that 

individual platoons differed sufficiently tp provide a reasonable basis for 

defining high and low improvement groups $ the platoons in each of the three 

samples were divided into (a) those falling in the upper 40% of both the 

difference and gain averages, (b) those falling in the lower 40% on both, and (c) 

all other platoon„s. Further analyses then considered several possibilities which 

might have affected the validity of the gain scores as bases for classification: 

(a) Gain scores accurately: estimated group differences only if the 
regression lines relating PPT 2 to PPTl were comparable within 
groups (21, pp. 40-50). Analysis of covariance showed non- 
parallel regression lines for the first two studies, but not 
the third (see Table A-4) . The statistically significant dif- 
ferences were small. Over the three studies, an average of 45% 
of the variation in PPT2 scores was predictable from the PPTl 
scores based on the overall sample regression line. Replacing 
the overall sample regression line with 3 within group regres- 



.91 

*t95 .90 
.90 .93 .95 



r 



.97 
.91 
.92 



.88 
.89 



.98 



.97 
.93 
.89 



.91 
.89 



.96 



sion lines in each study would 4jaye predicted an average of 
0.5% more of the PPT2 .variance. Thus, the similarity across 
improvement levels was much more Substantial than the 
differences. 



TABLE A-4 

COMPARISON OF REGRESSION LINES 
FOR DIFFERENT FITNESS IMPROVEMENT LEVELS 



TEST FOR NONPARALLELISM: WITHIN GROUP REGRESSION SLOPES: 

F SIG. eta*» ~ 



LOW 


MEDIUM 


HIGH 


.850 


.628 


.527 


ssa 
t«PO 


.637 


' .567 


.688 


.628 


.598 



Study 1 6.35 .002 .019 

Study 2 v 4.53 .011 .008 

Study 3 .1.57 .208 .001 



•ete J i$ the proportion of total verlence explained by the dlfforencei between the within jroup «flre»lon linei. 



(b) The small within group differences in regression slopes that 
were observed probably arose from having an upper limit foi 
fitness test scores. Examination of the scatterplots for the 
high intensity groups indicated that more recruits in this 
group wer.e at or near the 300-point maximum for the test. This 
fact could mean that the PPT 2 scores for this group were af- 
fected more than the other groups by a restriction of range 
which would effectively reduce the magnitude of the regression 
slope (22, pp. 140-141). 

(c) When measuring change, controls must be introduced to adjust 
for the effects of variables which might bias change estimates 
(8,9). Bias would arise if there were variables which were (i) 
correlated with fitness improvement and (ii) nonrandomly dis- 
tributed across the fitness improvement groups. Exploratory 
analyses showed that no sociodemographic variables were signi- 
ficant sources of bias in the fitness improvement estimates. 
Although high improvement platoons consistently had a below 
average proportion of whites, the correlation between race and 
fitness improvement was minimal. For the gain scores, the 
weighted average correlation adjusting for study differences in 
sample size was r - .03 (range: r - -.11 to r - .06). The 
comparable figures for difference scores were r - J- 0 * <5 an jJ e ; 
r - -.11 to r - .00). No other sociodemographic Variable was 
nonrandomly distributed across the fitness improvement groups 
in each of the studies. 
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Effectiveness of Group Classification , A Jinal set of analyses examined the 
clarity and consistency of discrimination between h.J.gh and low intensity platoons. 
Especially in an initial study such as the present, it is desirable that the 
groups being compared differ widely on the classification variable(s). Without 
substantial differences, comparisons between the so-called "high improvement " and 
"low improvement 91 groups would be meaningless. Although the classification 
procedures employed for defining the high improvement and low improvement groups 
guaranteed at least some difference, it was of interest to determine how 
substantial the differences vfere. Results were: 



(a) In all three samples, the high and low improvement groups dif- 
fered c substantially (p < .001) on both the difference scores 
and the gain scores. The gain score differences confirmed that 
the high improvement group exceeded the low improvement group 
taking any initial group fitness differences into account. 



(b) Pairwise comparisons between individual high intensity pla- 
toons and individual low intensity platoons were made using the 
least significant difference test for multiple comparisons with 
a modification to adjust for differences in group sizes {23). 
The significance criterion for pairwise comparisons was based 
on a 10% significance level for the overall set of com- 
parisons. The proportion of pairwise comparisons between high 
* -and low intensity platoons exceeding the criterion value were: 

\ (i) Study 1, .84.71 for gain scores; 18.1% foj: difference 
scores. (ii) Study 2, 96.5% for gain comparisons; 97.2%* for 
difference scores. (iii) Study 3, 95.1% for gain scores; 94.4% 
for difference scores. 



These findings showed good discrimination between the high and low im- 
provement groups except in Study 1. However, the typical difference between 
platoons in Study 1 was * roughly as large as that in the other two studies. 
Therefore, the * smaller size of the samples representing each platoon was the 
reason for tjhe lower frequency of significant between platoon differences (24) • 
Overall, the discrimination between high improvement and low improvement platoons 
was adequate to test our hypotheses. 
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Pr °Tnis S study capitalized on naturally Occurring platoon differences in fitness 
outcomes to classify BT platoons into relatively low and high improvement 
groups. The high improvement group had better attitudes toward the Marine 
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20, Abstract (continued) 

CP 

Corps, greater perceived self -improvement, and performed better during BT. The 
groups had comparable illness cates during BT, but there was more medical 
attrition and overall attrition in the high improvement group. The groups did 
not differ with regard to Fleet Marine Force promotion or attrition rates. 
These findings demonstrate that PT programs may indeed influence noh- 
fitness BT outcomes. Additional research is heeded to verify that PT differ- 
ences actually cause nonfitness BT outcomes, to determine whether these side 
effects are large enough to be of practical importance, and to identify 
specific aspects of PT programs that are important for promoting nonfitness BT 
objectives. 
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